ICA-based feature extraction for phoneme recognition
نویسندگان
چکیده
We propose a new scheme to reduce phase sensitivity in independent component analysis (ICA)-based feature extraction using an analytical description of the ICAadapted basis functions. Furthermore, since the basis functions are not shift invariant, we extend the method to include a spectral-domain ICA stage that removes redundant time shift information. The performance of the new scheme is evaluated for TIMIT phoneme recognition and compared with the standard mel frequency cepstral coefficient (MFCC) feature.
منابع مشابه
Phoneme recognition using ICA-based feature extraction and transformation
We investigate the use of independent component analysis (ICA) for speech feature extraction in speech recognition systems. Although initial research suggested that learning basis functions by ICA for encoding the speech signal in an e5cient manner improved recognition accuracy, we observe that this may be true for a recognition tasks with little training data. However, when compared in a large...
متن کاملSpectral Analysis of Speech: A New Technique
ICA which is generally used for blind source separation problem has been tested for feature extraction in Speech recognition system to replace the phoneme based approach of MFCC. Applying the Cepstral coefficients generated to ICA as preprocessing has developed a new signal processing approach. This gives much better results against MFCC and ICA separately, both for word and speaker recognition...
متن کاملIntegrated Phoneme Subspace Method for Speech Feature Extraction
Speech feature extraction has been a key focus in robust speech recognition research. In this work, we discuss data-driven linear feature transformations applied to feature vectors in the logarithmic mel-frequency filter bank domain. Transformations are based on principal component analysis (PCA), independent component analysis (ICA), and linear discriminant analysis (LDA). Furthermore, this pa...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملSpeaker Independent Isolated Tamil Words for Speech Recognition using MFCC, IPS and HMM
The process of converting an acoustic waveform into the text resembling the information, conveyed by the speaker is termed as speech recognition. Nowadays, normally Hidden Markov Model (HMM) based speech recognizer with Mel Frequency Cepstral Coefficient (MFCC) feature extraction is used. The proposed speech feature vector is generated by projecting an observed vector onto an Integrated Phoneme...
متن کامل